AD  606527 


1 


Approved  Tor 


">'.  1  cf'7 

,.,ji  iCilE  $.  C^  ‘>~0 


DDC 


iBCTEinr 

OOC-IRA  B 


-7Ar 


MOD 

1700  MAIN  ST 


SANTA  MONICA  •  CAU70INIA 


CLEARINGHOUSE  FOR  FEDERAL  SCIENTIFIC  AND  TECHNICAL  INFORMATION  CFSTI 

DOCUMENT  MANAGEMENT  BRANCH4IO.il 


LIMITATIONS  IN  REPRODUCTION  QUALITY 


ACCESSION 


fi  o  C.  17 


I.  WE  REGRET  THAT  LEGIBILITY  OF  THIS  DOCUMENT  IS  IN  PART 
UNSATISFACTORY  REPRODUCTION  HAS  BEEN  MADE  FROM  BEST 
AVAILABLE  COPY. 


n  2.  A  PORTION  OF  THE  ORIGINAL  DOCUMENT  CONTAINS  FINE  DETAIL 
WHICH  MAY  MAKE  READING  OF  PHOTOCOPY  DIFFICULT. 


n  3.  THE  ORIGINAL  DOCUMENT  CONTAINS  COLOR,  BUT  DISTRIBUTION 
COPIES  ARE  AVAILABLE  IN  BLACK-AND-WHITE  REPRODUCTION 
ONLY. 


Q  4.  THE  INITIAL  DISTRIBUTION  COPIES  CONTAIN  COLOR  WHICH  WILL 
BE  SHOWN  IN  BLACK-AND-WHITE  WHEN  IT  IS  NECESSARY  TO 
REPRINT. 


n  5.  LIMITED  SUPPLY  ON  HAND;  WHEN  EXHAUSTED,  DOCUMENT  WILL 
BE  AVAILABLE  IN  MICROFICHE  ONLY. 


n  6.  LIMITED  SUPPLY  ON  HAND:  WHEN  EXHAUSTED  DOCUMENT  WILL 
NOT  BE  AVAILABLE 


Q  1.  DOCUMENT  IS  AVAILABLE  IN  MICROFICHE  ONLY. 


Q  8.  DOCUMENT  AVAILABLE  ON  LOAN  FROM  CFSTI  (  TT  DOCUMENTS  ONLY). 


□  > 


To  .-107- 10  64 


PROCESSOR; 


P-1  loo 

U-26-57 


STATISTICAL  DKCISION  THEORY  AS  A  GUIDE  TO 
INFORMATION  PROCESSING 

by 

Harvey  M.  Mayner* 

Econooists,  statisticians,  and  practitioners  of  operations 
research  frequently  meet  nearly  identical  problems  in  their 
respective  studies.  Once  the  similarities  are  recognized, 
the  solutions  advanced  by  one  group  of  pro f e s s  iona Is o f ten  turn 
out  to  be  useful  to  others  in  different  disciplines.  T!ie 
bel ief . expressed  here  is  that  statistical  decision  theory 
provides  both  an  enlightening  and  a  unifying  approach  to  prob¬ 
lems  concerned  with  decision  making  in  the  face  of  uncertainty. 
As  will  be  pointed  out  subsequently,  statistical  decision 
theory  is  by  no  means  the  last  word  on  such  problems  —  at  least 
at  its  present  state  of  developaent--but  the  approach  seems  to 
ask  the  right  questions  and  accurately  pinpoints  the  areas  of 
di f f iculty . 


*  Tl.is  paper  was  written  while  1  was  affiliated  with 
Stanford  University  and  presented  at  the  Data  Processing  and 
Management  Information  Conference,  Massachusetts  Institute  of 
Technology,  July  15-r>,  ivr.7.  The  author  owes  more  than  the 
usual  debt  of  gratitude  to  Professors  Herman  Chernoff  and 
Lincoln  Moses,  Stanford  University,  for  permission  to  read 
their  forthcoming  book  on  decl^sion  theory. 


I  NTH  OP  UCTION 


The  advance  which  decifion  theory  makes  over  previous 
nathods  in  mathematical  statistics  is  that  the  economic  con> 
sequences  of  an  action  are  explicitly  taken  into  account.  In 
other  words,  the  theory  goes  beyond  statements  about  probabil¬ 
ities  of  making  various  errors,  and  incorporates  both  the 
relative  losses  from  such  errors  as  well  as  the  costs  of  pro¬ 
cessing  Information  in  order  to  reduce  the  likelihood  of 
mistakes.  One  important  consequence  claimed  by  decision 
theorists  is  that  by  such  analysis  it  is  possible  to  unify 
various  subfields  in  statistics  into  a  single  conceptual 
framework.  For  the  moment  we  shall  refrain  from  stating  the 
alleged  disadvantages  of  the  theory. 

The  general  problem  of  decision  making,  whether  studied 
by  a  statistician,  an  economist,  or  an  operations  researcher, 
can  conveniently  be  stated  as  follows:  The  decision  maker  has 
to  choose  some  course  of  action  out  of  several  open  to  him.^ 
Such  an  action  may  pertain  to  an  existing  state  of  affairs  or 
to  future  events;  in  any  case,  the  decision  maker  does  not  know 
what  the  true  state  really  is,  and  hence  he  has  to  choose  an 
action  under  conditions  of  uncertainty.  The  economic  conse¬ 
quences  of  the  situation  are  a  Joint  function  of  the  action 


^Ke  sliflll  not  he  ('oncerned  with  tl>e  organizational  or  team 
problems  of  decision  making.  We  assume  that  the  individual, 
team,  organization,  etc.,  all  liav»*  identic  nl  goals. 


p-lloo 

U-20-G7 

-3- 


taken  and  the  true  but,  at  present,  unknown  state  of  affairs. 

It  is  useful  to  think  ol  this  situation  as  a  yane  ployed  by 
Nature,  who  chooses  the  underlying  state,  and  the  Statistician 
(or  the  decision  maker),  who  selects  an  action.  Usually  the 
Statistician,  by  means  of  relatively  costly  data  processiny, 
is  able  to  obtain  some  information  about  the  strateyy  Nature 
has  selected.  The  Statistician  must  balance  the  costs  of  data 
processiny  with  the  costs  of  makiny  mistakes  at  a  frequency 
which  could  potentially  be  lowered  if  more  information  were 
available.  The  data  conceivably  available  to  the  Statistician 
may  or  nay  not  be  able  to  yive  complete  information  as  to 
Nature’s  strateyy. 

Tl;e  above  formulation  applies  easily  to  the  case  of  a 
manayement  yroup  making  some  decision  about  tlie  company’s  sales, 
production,  or  investment  policies  by  ’’sawpliny”  information. 

The  cost  of  sampliny  of  course  ma)  include  the  use  of  an  elec¬ 
tronic  computer  as  well  as  the  expense  of  collectiny  data. 
Consequently  a  wide  variety  of  data  processiny  problems  may 
potentially  oe  Imitdled  by  decision  theory  techniques. 

OUIUNL  OF  A  DECISION  THEORY  PROBLbli 
Statistical  decision  theory,  not  unlike  schools  of  thought 
in  economics,  mathematics,  or  philosophy,  is  based  on  a  system 
of  axioms.  These  postulates  are  far  from  inconsequential,  but 
space  limitations  prohibit  a  lenythy  discussion  of  the  axioms. 
Briefly,  their  main  implication  is  that  it  is  possible  to 

assiyn  numerical  valuci  to  the  Joint  result  of  the  c>lotisti- 

cian^  and  Nature's  sirateyles;  these  numerical  values  are  what 


P-1100 

0-26-57 

-4- 


we  have  been  calling  the  "econonlc  consequence p "  of  the  final 
sltuuliuii  lion  the  puitil  ul  view  ul  Die  Ola  1 1  st  i  c  i  an  . iurlliur, 
u  i  V  e  n  Nut  u  i  e  S  choice,  if  one  oction  results  in  a  nuiuericul 

value  oi  10,  suy,  und  uriulhet  oclioii  results  in  a  nubterlcal 

value  of  20,  then  tiic  combined  **action’'  of  flipping  a  fair 

coin,  so  that  uii  heads  Die  first  oction  is  loken,  and  on  toils 

the  second  action  is  inken,  lias  tnc  nutsericul  value  of  ilie 
a  r  i  l  lime  I  i  ca  1  uvernge  'j  \  lu  -*■  ‘j  x  20  --  15.  In  nost  elementary 
presentations  of  Die  theory  of  games,  the  numerical  value  is 
usually  assumed  to  be  the  monetary  consequence  or  payoff  of  the 
situation,  iiucli  n;i  a  d  d  1 1 ;  o  n  a  1  assUL.ption  uay  or  lao)  not  be 
tenable  in  u  purticular  cast;  but  in  any  event,  decision  theory 
assumes  lliut  some  numerical  indicator  ol  preference  for  various 
situations  lu  available^  and  mat  it  is  ucuningful  to  take 
probability  averages  of  tuese  numue.s  in  evaluating  me  rela¬ 
tive  merit  of  different  coabinations  of  uncertain  outcomes. 

For  exposil.o.ul  oii>  a>sui3e  tliet  Nature  and  the 

St  a  1 1  s  V  1  c  .  a  ii  huvt  o  i.iitc  i  urn  Dor  of  s:..iple  ol  Leriiat  ives; 
Nature's  ouoicua  are  '•j,  ...•  and  ine  Statistician's 

actions  arc  a^,  £21  •••!  ^ j  •  •••  •  7he  Statistician's  numerical 

indicator  of  the  onicorae  of  Nature's  selecting  and  his  taking 
action  Oj  is  denoted  ns  utWj,  oj)  .  The  entire  set  of  conse¬ 
quences  can  be  displayed  in  matrix  form,  Fxhibit  1. 


*  > 

‘*In  the  technical  literature  the  nuoierical  indicator  is 
called  ♦he  Statistician*''*  ntillly  functionv 


P-llOO 

U-2o-f)7 

r 


Suppose  tbit  the  Statistician  has  the  opportunity  of  per- 
forming  a  single  costless  experlaant.  The  experiment  may  be 
complicated  and  may  offer  a  variety  of  bits  of  information,  but 
assume  that  tiie  outcome  of  the  experiment  can  be  summarised  by 
a  '*vector''  symbol  and  that  there  are  only  a  finite  number 

of  Zj^.  For  example,  one  experiment  might  be  a  yes-no  question¬ 
naire;  in  this  event  Z{^  would  be  a  ''vector'*  of  information 
yielding  the  number  of  yes  answers  to  the  first  question,  to 
the  second  question,  to  the  n-th  question. 

By  assumption,  the  data  Zj^  are  related  to  .  More 
precisely,  suppose  tiiat,  u i yen  any  ,  the  probability  of 
observing  is  known,  which  is  denoted  as  the  "conditional" 
probability  p(  Once  again  the  conditional  probabil¬ 

ities  can  be  arrayed  by  means  of  a  table.  Exhibit  2.  Each  row 
in  the  matrix  indicate^  the  conditicnai  probability  of  observ¬ 
ing  every  given  that  Is  the  true  state  of  nature. 

Next  we  define  the  notion  of  a  simple  strategy  for  the 
Statistician.  He  call  that  the  Statistician  may  observe  any 
and  accordingly  take  any  action  ij  .  Conceptually  all 
possible  simple  strategies  available  can  be  formulated  by 
listing  all  combinations  of  actions  associated  with  ^I'serva- 
lions.  Exhibit  3,  Each  row  in  the  matrix  is  a  simple  strategy, 
which  specifies  the  action  to  be  taken  If  a  Is  observed, 

A1  ogether  the  number  of  simple  strategies  are; 

number  of  possible  observations 


(number  of  actions) 


P-1160 

0-26-57 

-0- 


For  example,  if  there  nre  two  actions  and  five  possible  observe- 

r 

tions,  tlien  there  are  2  s  32  siuple  strateyies. 

Frun  Exhibits  2  and  3  we  are  able  to  construct  a  matrix 
Tor  each  strategy  s^,  which  yields  the  probability  p(aj|N|,  s^^) 
of  taking  a  particular  action  a  ,  given  Nature's  N  and 

J  * 

strategy  Sjj,  Exhibit  <1. 

Finally  Exhibits  A  and  1  are  combined  to  produce  a  table 
showing  the  expected  or  average  numerical  values  for  each  pair 
of  strategies.  Since  for  a  particular  strategy  Exhibit  4 
gives  the  probability  of  taking  an  action  for  each  state  of 
nature,  and  since  Exhibit  1  contains  the  numerical  consequences 
associated  with  each  action  and  state  of  nature,  we  average  the 
numerical  outcomes  and  enter  them  in  Exhibit  5  as  U(Np  ^ 

2  P<«j  jNp  Sf,)  u(Nj,  aj  )  . 


Exliibit  3  completely  embodies  the  problem  as  defined.  It 
shows  all  the  slmpl?  strategies  open  to  the  Statistician  and  to 
Nature.  In  addition  t)  these  simple  strategies,  each  player 
can  also  elect  lu  ’’randomize'*  between  the  simple  strategies, 
i.e.,  to  select  eacii  simple  strategy  according  to  a  certain 
probability. 

It  is  now  appropriate  to  discuss  the  difficult  topic  of 
what  is  a  good  strategy  for  the  Statistician.  It  should  be 
stated  at  the  outset  that  this  is  a  debatable  subject,  and 
various  alternative  suggestions  have  been  put  forth.  Only  a 
few  of  them  will  be  briefly  explained;  Blackwell  and  Girshick, 
and  Savage  contain  more  complete  treatments.  One  proposal. 


P-U60 

U-26-57 

-7- 


based  on  o  "play  sarc"  notion,  is  to  iynore  the  data  and  pick 
a  "uiniuax"  action  which  protects  ayainst  the  worst  possible 
selection  of  Nj  by  Nature.  Usiny  citiier  Exhibit  1  or  those 
straleviles  in  Exhibit  G  which  ignore  (i.e.,  pick  tlie  satae 
Sj  for  all  ),  deteruine  tlie  worst  uuaerical  outcome  that  may 
arise  with  the  selection  of  on  oj  ;  then  choose  that  particular 
aj  which  assures  the  best  out  of  the  "vtorst"  values  previously 
found . 

An  extension  of  ths  above  procedure  is  to  use  all  the 
strategies  in  Exhibit  Ci,  and  to  select  the  "ainimax''  from  these 
Strategies,  now  specifically  allowing  probability  mixtures  or 
raudomization  between  strategies,  if  desirable.  The  numerical 
value  associated  oith  such  a  genera  1  i  zed ;  uii  nimax  strategy  is 
usually  an  average  value  of  the  II(N^,  components  in 

Exhibit  *>,  ifhicli  in  turn  are  averages  derived  from  Exhibits  1 
and  1. 

If  the  Statistician  n  n  s  some  a  priori  1  nf  urcint  ion  (say, 
from  past  relevant  expo,  ience  a.)d  data)  that  Nature  selects 
N.  with  probability  w^,  tlien  the  Sj^  such  that  w^U(Nj,  Sj,) 
is  maximized  defines  an  optimal  selection,  which  is  called  a 
Bayes  strategy.  Even  if  a  priori  probabilities  about  Nature 
are  not  known,  it  is  clear  that  the  Statistician  should  con¬ 
sider  only  strategies  which  are  at  least  optimal  for  some  set 


^See  the  Bibliography. 


P-1160 

a-2o-57 

-0- 


of  ■  priori  probabilities.  This  class  of  strategies,  which 
here  will  be  called  the  admlssihle  f  t  r  a  teg  ies ,  is  usually 
considerably  narrower  than  all  the  siaple  and  alxed  strategies 
implied  by  Exhibit  fi.  Interesti  ngly  enough,  for  each  possible 
set  of  a  priori  probabilities  over  the  ,  there  is  at  least 
one  s  i  ui>  1  e  strategy  S|,  wliicli  is  optimal  for  the  Statistician. 
In  special  cases  it  is  possible  by  appealing  to  "likelihood 
ratio"  manipulations  to  determine  ail  the  admissible  strate¬ 
gies  rather  easily  witliout  the  eomplitm  enumeration  of 
Exhibits  o  and  5. 

Ihe  general  framework  of  a  statistical  game  may  now  be 
summarized:  The  Statistician  and  Nature  are  the  two  players, 
each  with  certain  poasible  strategies  or  actions;  there  is  a 
determinate  economic  evaluation  for  the  Statistician  depending 
on  the  outcome  of  both  players*  selection  of  strategies;  it  is 
possible  for  the  Statistician  to  perlorm  experiments  and 
observe  iniormation  p  lalning  to  Nature's  clioice  of  a 
strategy;  out  ol  8i*  possible  strategies  for  the  Statistician, 
attention  is  conlinea  to  the  class  ol  admissible  strategies, 
i^ . ,  3  sirateg)  which  is  Uayes  for  at  least  some  a  priori 
probabilities  for  Natuie,  it  can  Le  shown  that  one  such 


'^In  oathematicnl  statistics  there  is  a  line  distinction 
between  the  classes  ol  admissible  »iraleyieo  and  uf  Layes 
strategies;  further,  in  special  «,ames  no  admissible  strategies 
■  ay  but  we  shail  not  be  concerned  with  sucli  leclinical 

ma  ’  *  p  r  s  In  I  !i  i  s  paper. 


r-iiM) 

a-26-57 

-9- 


■dalssible  strategy  is  that  associated  with  the  niniaiax  averaye 
nuaarieal  avaluatioa,  and  which  aay  be  a  yood  strateyy  if  the 
Statistician  has  a  priori  inforaation  about  Nature.  In  the 
next  section  we  introduce  the  cost  of  saapliny,  which  pre* 
riOBsly  we  have  ignored. 

A  CLOSER  LOOK  AT  THE  DATA  PROCESSING  OPERATION 

The  effects  of  experiaentat ion  will  now  be  aore  carefully 
axaained  to  deaonstrate  an  efficient  aethod  of  extracting 
inforaation  out  of  the  saaple  data  end  to  delineate  the 
econoaic  consequences  of  obtaining  different  eaounts  of  costly 
inforaation. 

Although  the  conceptual  fraaework  advanced  above  is 
cuaplete,  the  extent  of  enumeration  of  simple  strategies 
needed  to  accoaplish  the  analysis,  even  for  ordinary  sized 
probleas,  aay  be  overwhelainy  if  soae  shortcuts  ate  not 
available:  f urtlieraore,  auch  of  the  effort  expenoed  in  the 
exhaustive  approach  ts  on  strategies  which  turn  out  to  be 
Inadaissible.  Fortunately  probability  theory  peraits  certain 
iaportant  s iapl if  1 ca 1 1 cns  in  the  procedures  previously  out* 
lined. 

In  the  case  where  no  experimental  data  exist  but  a  priori 
probabilities  for  Nj  are  available,  it  has  been  stated  that 
with  Exhibit  1  the  probability  averages  over  the  different 
b(Nj,  aj)  for  each  action  aj  would  be  calculated,  aiid  the 
correct  action  would  be  the  one  yielding  the  highest  average. 

If  soae  experimental  data  do  exist,  the  procedure  outlined 


P-iMO 

6-26-87 

-10- 

for  Exhibit  5  aay  b«  aquivalently  parforaed  by  usinu  the  axperi- 

aantal  data  to  tranifora  the  a  priori  probabilities  into  what 

are  called  a  posteriori  probabilities:  the  latter  probabilities 

are  then  applied  to  the  entries  in  Exhibit  1  Just  as  the 

0 

a  priori  probabilities  would  be  applied  in  the  no  experiaental 
data  case. 

Froa  Exhibit  2  the  conditional  probability  p(Z|jfN|)  of 
observing  Z|^  given  is  known,  and  w^  denotes  the  a  priori 
probability  of  .  As  defined  by  probability  theory^ 
p(Z|jlNj)  s  and  )  /  W|  • 

In  other  words,  the  conditional  probability  of  Z|^,  given 
N^,  is  equal  to  the  Joint  probability  of  both  S|^  and  N| 
occuriny  dlvidad  by  the  a  priori  probability  of  Rearrang¬ 

ing  teras  gives 

W|p(z|j!N|)  -  p(>i(  ■nd  )  • 

The  event  of  observing  is  the  "sua"  of  the  autually 
exclusive  and  coapletely  exhaustive  events  of  obtaining  Z|( 
when  is  the  true  state  of  nature,  when  is  the  true 
state  of  nature,  ...,  Z|^  when  N.  is  the  true  state  of  nature, 
etc.  In  probability  teras 

p(Zjj)  =  p(zjj  and  Nj)  p(Z|(  and  •••■*■  P^*|(  Nj)  ♦  .. 


5ir.  J.  Dixon  and  F.  J.  Hassey,  Jr.,  Introduction  to 
Statistical  Analysis.  McGraw-Hill,  Mew  York,  1957,  pp.  332-333; 
A.  ■.  Mood.  Introduction  to  the  Theory  of  Statistics.  Hc-Graw- 
llill.  New  York,  1950,  pp.  2o-30. 


P-1160 

0-26-57 

-11- 


Therefore  the  above  foraulas  are  coabined  to  derive  the 
a  posteriori  probability  of  Nj  given 

p(N|izk)  -  p(a|(  and  Nj  )  /  p(s|(),  by  definition 

“  WlpCiklNi)  /  Mjp(Z|^lNp  ♦  tir2p(z|(lN2)^.  • 


which  nwaerlcally  is  W|  transformed  to  an  a  posteriori  proba¬ 
bility  by  BMltiplying  by  an  appropriate  factor  that  is  a  func¬ 
tion  of  the  actual  observed  Zj^.  It  can  be  proved  that  the 
Bayes  procedure  as  outlined  with  Exhibits  1-5  is  equivalent  to 
the  procedure  of  applying  the  a  posteriori  probabilities  w^ 
to  Exhibit  1.  It  can  also  be  shown  that  if  successive  experi- 
■ents  are  perforaed,  e.g.,  if  the  inforaation  in  the  vector  Zj^ 
is  actually  gotten  single  experiaent  by  experlMatf  then  the 
correct  procedure  is  continually  to  "revise*  or  to  "update”  the 
a  posteriori  probabilities  using  the  inforaation  gained  froa 
the  new  experiaaatal  data. 

Is  the  suggested  procedure  a  shortcut?  Recall  in  Exhibit  3 
it  was  necessary  to  construct  a  coaplete  listing  of  every 
possible  strategy:  the  number  of  such  strategies  depended  on 
the  nuaber  of  all  possible  which  could  be  observed.  The 
shortcut  is  that  a  bayes  procedure  need  only  call  for  certain 
coaputations  utilizing  an  actually  observed  z^i  therefore  in 
practice  it  is  not  necessary  to  list  all  strategies  taking 
into  account  any  eventuality,  but  rather  to  aake  coaputations 
based  on  the  particular  result  of  the  experiaent.  Analogous 
reasoning  applies  to  the  results  from  a  sequence  of  experiaents. 


P-U60 

0-26-57 

-12- 


Me  finally  coae  to  the  iaportant  point  of  when  costly 
exper iaentation  or  data  processing  should  cease  and  an  action 
be  taken.  The  case  of  a  sequential  saapling  procedure  is 
discussed  here;  the  simpler  case  of  a  fixed  sample  size  plan 
is  examined  in  the  following  section.  The  mathematical  condi¬ 
tion  for  the  correct  stopping  place  in  a  sequential  game  is 
well  defined.  The  analysis,  which  is  closely  related  to 
Bellman's  principle  of  optimality  in  dynanic  programming,  is 
as  follows:  If  a  decision  is  made  at  the  end  of  some  stage  of 
experimentation,  the  numerical  value  for  the  Bayes  procedure 
is  found  from  an  average  of  the  a  posteriori  probabilities 
and  the  entries  in  Exhibit  1.  If  farther  experimentation  is 
undertaken,  the  result  will  be  a  random  variable,  and  new  a 
posteriori  probabilities  will  be  derived.  After  an  additional 
observation  is  processed,  a  similar  calculation  is  once  again 
made  whether  further  sampling  should  follow  or  an  action  be 
taken.  Because  the  outcome  of  an  additional  observation  is  a 
random  variable,  tne  decision  of  what  to  do  next  will  also  be 

I 

random.  The  process  is  repeated  until  further  sampling  is 
uneconomical . 

Whenever  inspection  continues,  tl.e  cost  of  making  each 
experiment,  reckoned  in  numerical  values  consistent  with  those 
in  Exhibit  1,  must  be  subtracted  in  order  to  arrive  at  the  net 
valuation  ox  further  experimentation.  Usually  more  information 
about  Nature's  strategy  will  increase  the  expected  Bayes 
average  valuation.  The  question  is  whether  the  increment  in 


p-n6o 

0-26-57 

-la- 


economic  value  of  more  date  is  offset  by  the  cost  of  obtainiey 
it.  Since  the  experimental  results  are  random  variables,  at 
each  staye  of  the  aealysis  a  complicated  procedure  is  needed 
for  compntiny  averayes  reflecting  the  valuation  of  some  parti¬ 
cular  overall  sampling  strategy.  The  final  decision  about  a 
new  experiment  rests  on  a  comparison  of  the  present  a  posteriori 
Bayes  average  value  and  the  net  expected  value  if  another 
experiment  is  achieved  and  the  Statistician  acts  ootlmal 1 v 
thereafter.  As  Blackwell  and  Girshick  have  demonstrated,  in 
certain  special  cases  (analogous  to  elementary  cases  in 
sequential  analysis)  the  operating  procedures  for  a  "sequential 
statistical  game”  are  fairly  simple.  In  general,  a  computing 
procedure  for  solving  such  problems  is  very  complex. 

AN  ILLUSTHATION  IN  QUALITY  CONTML 
An  application  in  the  area  of  quality  control  will  serve 
to  illustrate  the  decision  theory  teclinique.^  Small  lots  of  a 
complex  assembly  item  are  to  be  sjbjected  to  an  acceptance 
sampling  procedure.  It  is  known  from  experience  that  the 
number  of  defects  per  item  occurs  according  to  a  Poisson 
probability  distribution;  and  for  the  sake  of  simplicity,  it 
is  postulated  *'ere  that  Nature  "produces"  lots  after  selecting 


'^For  a  challenging  presentation  of  quality  control  applied 
to  data  processing  problems  of  an  accounting  nature,  see 
L.  L.  Vance  and  J.  Neter,  Statistical  Samolinu  for  Auditors 
and  Accountants.  Wiley,  New  York,  IVT'O. 


P-1160 

0-26-57 

-14- 


a  Poisson  distribution  with  an  average  of  either  10  or  20 
defects  per  lOO  items.'  In  the  former  case,  the  lots  are 
acceptable,  and  in  the  latter  case  unacceptable.  Exhibit  6 
contains  the  Statistician's  payoff  matrix.  In  this  example, 
instead  of  representing  losses  as  negative  numbers  employed 
in  a  maximizing  operation,  they  are  treated  as  positive 
numbers,  and  strategies  which  minimize  loss  are  to  be  investi¬ 
gated.  It  is  assumed  that  these  monetary  outcomes  are  good 
approximations  to  the  Statistician's  "utilities." 

The  minimax  strategy  for  the  Statistician,  if  he  does  no 
sampling,  is  to  select  with  probability  1/3  and  t2  witli 
probability  2/3.  The  expected  value  of  the  outcome,  $6.66, 
is  then  independent  of  Nature's  strategy.  If  the  a  priori 
probability  Wj  =  3/4  and  =  1/4,  then  a^  is  the  optimal 
action,  giving  an  expected  value  of  ^3.00. 

Altliough  the  size  of  a  sample  is  a  variable  which  should 
be  subject  to  economic  analysis  in  a  proposed  statistical 
procedure,  assume  that  for  various  reasons  only  2  items  drawn 
randomly  out  of  the  lot  are  to  be  inspected.  Tlie  sample 
observations  will  be  classified  into  three  categories: 

Zj  =  0  defects,  z^  -  1  defect,  Z3  =  2  or  more  defects  (if  two 


utilize  the  distinction  employed  in  quality  contrmi 
of  defect  vs.  defective.  The  latter  is  defined  In  terms  of 
the  particular  number  of  allowmble  defects-  per  item. 


p-n6o 

0-26-57 

-15- 


defects  arc  found  in  either  or  both  itens,  inspection  ceases). 
The  conditional  probabilities  for  7.|^  are  shown  in  Exhibit  7.^ 
There  are  2  posrlble  actions  and  3  possible  observations: 
hence  23-0  siaple  strategies  exist,  Exhibit  0.  Stroteyy 
for  exanple,  specifies  selecting  action  if  occurs,  and 

a^j  otherwise.  If  Nj  is  the  true  slate  of  nature,  then  Zj 
occurs  with  probability  .02,  and  consequently  action  a^  is 
taken  with  probability  .02.  Exhibit  9  gives  the  action 
probabilities  for  each  strategy.*^ 

Finally  Exhibit  1C  coabines  tne  previous  uotrices  to  give 
the  expected  or  average  losses  for  each  of  the  strategics.  A 
first  glance  at  Exhibit  10  does  not  revenl  v/hich  strategies, 
if  any,  arc  inadmissible;  a  graphical  analysis  aids  in  the 
process.  Figure  1.  The  expected  losses  ere  ploired  as  two 
coordinate  points  with  reference  to  axes  for  and  No.  The 
bottopi  boundary,  which  is  the  lowest  convex- to- tiie-o  r  ig  1  n 
boundary  defined  by  strategy  points,  has  tl.e  ddmissible 
strategies  as  vertices:  Sj  ,  So  . 


^If  the  nunber  of  defects  in  100  items  has  a  Poisson  dis¬ 
tribution  with  an  average  q,  then  it  is  postulated  that  the 
number  of  defects  in  2  items  is  distributed  as  a  Poisson  with 
an  average  2q/lOU. 

^The  mathematician  states  that  each  strategy  defines  a 
"mapping*  from  t!ie  sample  space  to  the  action  space. 

alternative  delineation  of  admissible  strategies  may 
be  found  in  J.  L).  Williams,  The  Coppleat  jtrateuvst.  BcGraw- 
Hill,  1954,  pp.  71-72. 


p-n6o 

0-26-57 

-16- 


The  ainiaax  strategy,  found  at  the  intersection  of  the 
bottoB  boundary  and  a  4o*  line  throuyh  the  oriyln,  is  to  select 
s^  with  probability  . 4o  and  s^  with  probability  .54.  Given  a 
priori  probabilities,  the  correspondi ny  Bayes  strateyy  is  found 
either  by  applying  the  probabilities  to  Exhibit  10,  or  by 
findiny  at  which  of  the  adaistible  strateyies  it  is  possible 
to  construct  a  tanuent  line  with  slope  -w^/W|.  If  w^  s  3/4 
and  -  1/4,  s^  is  the  optiaal  strateyy. 

If  a  ainiaax  procedure  is  to  be  eaployed,  it  has  been 
stated  that  the  expected  loss  without  any  data  is  $6.66;  with 
data,  the  ainiaax  expected  loss  becoaes  $o.26.  Therefore,  it 
does  not  pay  to  take  a  saaple  of  2  iteas  unless  the  saapliny 
cost  is  less  than  $  .40.^^  In  the  case  of  ^  3/4  and 
Wo  =  without  data  the  expected  loss  is  $5.00  ,  and  with 

data  is  $4.70.  Hence  with  this  a  priori  inforaation  it  pays 
to  inspect  two  iteas  only  if  the  cost  of  observation  is  less 
than  $  .30.  Such  considerations  are  at  the  heart  of  selectiny 
a  sinyle  stage  sample  si2e  or  a  sequential  saapliny  procedure.  “ 


^Whis  statement  aust  be  qualified  if  tiiere  is  soae  value 
in  collecting  data,  say,  for  asking  a  future  estiaate  of  the 
a  priori  probabilities. 

^-As  tiie  reader  may  verify,  increasing  the  saaple  size  has 
the  effect  of  lowering  the  boundary  line  in  Figure  1  toward  the 
origin.  But  the  marginal  value  of  successive  observations  varies 
with  the  fora  of  the  probability  distribution,  the  sample  size, 
and  the  a  priori  probabilities.  fience  depending  on  the  afore- 
aentioned  considerations  and  data  processing  costs,  it  aay, 
for  exaaple,  pay  to  take  two  observations  where  it  would  not 
be  economical  to  take  one. 


r- 1 1 60 

0-26-:i7 

-17- 


The  use  of  ■  posteriori  probabilities  to  arrive  at  a 
procedure  identical  to  the  adaissible  strategy  defined  in 
Exhibit  10  is  illustrated  with  Wj  =  C/4  and  wo  =  1/4,  for 
which  is  optiaal. 

If  is  observed 

^  =  - - SO  •Pply*'’W 

1  3/4  X  .32  ♦  1/4  X  .07 

Wj  and  ^2  to  Exhibit  o,  is  found  optimal. 

If  Z2  is  observed 

“1  "  3/4  X  .16^f>  1/4  X  .27  “  ‘^2  "  *"*’  ”2 

opt iaal . 

If  Z3  i*  observed 


«  S  3/4  X  .J2 

1  3/4  X  .02  ♦  1/4  X  .06 


.CO,  and  a^  is 


optimal . 


SOMIIARY  AND  EVALDATION  OF  THE  DECISION  THEORY  APPROACH 
As  claimed  at  the  be^inniny  of  the  paper,  the  statistical 
decision  theory  approach  to  data  pr^cessiny  seems  to  isolate 
the  crucial  points  of  decision  makin^j  problems.  The  outcome 
of  the  decision  maker's  action  is  a  function  of  not  only  what 
he  does  but  what  the  true  state  of  nature  is.  In  spite  of  the 
difficulty  of  measuring  economic  v.wnsec{uences  of  different 
situations,  it  seems  necessary  to  assume  some  sort  of  economic 
evaluation  in  order  to  arrive  at  any  semblance  ol  rationality 
in  a  systematic  approach  to  decision  makiny.  The  decision 
theory  technique  "automatically”  weighs  the  different  economic 
considerations  involved  in  taking  actions  and  qatheriny  infor¬ 


mation. 


P-1160 

0-26-57 

-10- 


In  closing,  SODP  of  t!ie  serious  drtwbacks  which  appear  in 
the  suggested  approach  should  be  discussed.  It  is  very  iapor- 
tant  to  realize  that  the  limitations  cited  below  may  very  well 
apply  to  any  systemotic  method.  Criticisms  have  been  made  at 
several  levels  of  analysis.  One  set  of  criticisms  concerns 
(a)  the  possibility  of  setting  up  a  meaningful  game  in  the 
first  place,  and  (b)  the  feasability  of  placing  economic 
evaluations  on  different  outcomes.  The  latter  is  partly 
answered  by  the  reply  that  any  statistical  procedure  has  in 
it  either  an  implicit  or  an  explicit  economic  evaluation  of 
outcomes.  It  is  more  realistic  (and  courageous.')  to  make  such 
considerations  explicit  rather  than  implicit.  The  former 
argument,  for  example,  questions  the  notions  and  assumptions 
involved  in  Exhibit  2.  Whether  the  requisite  probability 
information  is  available  is  a  factual  matter  to  be  determined 
for  various  situations.  When  such  information  is  lacking,  one 
siiould  immediately  be  on  guard  in  judging  alternative  approaches. 

A  second  level  of  difficulty  is  the  amount  of  mathematical 
manipulations  necessary  to  obtain  an  answer.  This  criticism 
includes  (a)  the  high  level  of  theoretical  mathematics  demanded 
to  analyze  a  statistical  game,  (b)  as  a  consequence,  the  con¬ 
centrated  effort  needed  to  attain  new  theoretical  anfmeri,  and 
(c)  the  difficult  computations  required  to  solve  a  particular 
case.  Persons  familiar  with  dynamic  programming  will  recognize 
that,  although  the  latter  technique  is  a  very  powerful  conceptual 
mode  of  analysis,  even  modern-day  high  speed  computers  are  not 


F-1160 

U-26-57 

-19- 


econoaically  able  to  apply,  to  particular  cates,  soae  of  the 
theoretical  results  which  have  been  found. Thus  decision 
theory  possibly  nay  becune  a  helpful  way  of  takiny  a  first 
look  at  a  problen  or  checkiny  an  approximate  solution. 

A  third  level  of  difficulty  pertains  to  the  selection  of 
a  yood  strategy.  Often  a  priori  probabilities  of  are  not 
known,  and  correspondinyly  a  Bayes  solution  is  nut  defined. 

One  answer  given  to  this  criticism  is  that  sufficient  experi¬ 
mentation  will  result  in  a  posteriori  information  "swanpiny" 
the  a  priori  assumptions.  Sucli  an  answer  is  hardly  a  convincing 
defense  of  the  approach.  Statisticians,  much  like  economists 
writing  in  the  ares  of  "new  welfare  economics,  have  often 
contented  themselves  with  merely  characterizing  the  class  of 
admissible  strategies,  with  the  v4t«  that  this  is  the  class 
containing  all  rational  strategies.  But  the  practicing 
statistician  will  undoubtedly  want  some  further  nelp  on 
choosing  one  strategy  out  of  this  class,  and  some  indication 
of  how  his  present  operating  procedures  compare  with  those 
suggested  by  the  decision  theorists. 


S.  E.  Dreyfus,  "Computational  Aspects  of  Dynamic 
Programming,"  Ouerations  Research  (5),  June,  1957,  409-416, 

^^For  an  elementary  presentation,  see  F.  H.  Bator,  "The 
Simple  Analytics  of  Welfare  Maximization,"  American  Economic 
Bevlew.  March,  1957,  pp.  22-59, 


P-1160 

8-26-57 

-20- 


In  conclusion,  the  decision  theory  approech  presents  a 
challenv^ing  and  coupre!iensive  way  of  looking  at  data  processing 
problems.  Surely  any  alternative  approach  should  be  required 
to  answer  the  questions  posed  by  decision  theory.  It  reaains 
to  be  seen  whether  decision  theory  has  posed  all  of  the 
essential  questions,  and  furtheraiore  whether  it  will  be  able 
to  answer  those  queries  which  already  have  been  foraulated. 


P-1160 

0-26-57 

-21- 


BIBLIOGRAPHY 


There  ere  nunerous  articles  in  the  statis¬ 
tics  literature  on  decision  theory.  Several  of 
the  iaporiant  books  having  a  bearing  on  the 
subject  are: 


Bellaan,  K.,  Ovnaalc  Prouraani nu.  Princeton  Dniver- 
sity  Press,  Princeton,  1**57. 

Blackwell,  D.,  and  Girshick,  I.  A.,  Theory  of  Caaies 
and  Statistical  De-isions.  Niley,  New  York,  1954. 

■cKinsey,  J.  C.  C.,  Introduction  to  the  fheorv  of 
Caaes.  McGraw-Hill,  New  York,  1952. 

Savage,  L.  J.,  The  Foundations  of  Statistics.  Wiley, 
New  York,  195  1. 

Wald,  A.,  Statistical  Decision  Functions.  Wiley, 

New  York.  1950. 


Exhibit  2  Conditional  Frobabilitiai  p(zkfN|) 


Cl  Cl 


P-1160 

0-26-57 

-25- 


STATISTICAL  DECISION  THEOBY  AS  A  GDIDE  TO  INFORHATION  PROCESSING 


P-1160 

0-26-57 

-26- 


P-1160 

0-26-57 

~21~ 


STATISTICAL  DECISION  THEQUY  AS  A  GUIDE  TO  INFORMATION  PBOCESSING 

STATISTICIAN'S 

STRATEGIES 


States 

of 

Nature 


»1 

•2 

*3 

*4 

•g 

*6 

•7 

»0 

0 

.20 

1.60 

■  '  ■ 

1.00 

0.20 

0.40 

9.00 

10.00 

20.00 

10.00 

1  *1  •  oo 

13.40 

o.oO 

5 . 40 

1.20 

0 

Exhibit  10  AveratiC  Ecunonic  Evaluation  in  Dollars 


